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(54) Title: ALDEHYDE DEHYDROGENASE GENE 



J? (57) Abstract: The present invention relates to a novel DNA which encodes aldehyde dehydrogenase (SNDH), an expression vector 
containing the DNA and recombinant organisms containing said DNA. Furthermore, the present invention concerns a process for 
producing recombinant aldehyde dehydrogenase protein and a process for producing L-ascorbic acid (vitamin C) and/or 2-keto-L-gu- 
Ionic acid (2-KGA) from L-sorbosone by using the recombinant aldehyde dehydrogenase protein or recombinant organisms contain- 
ing the expression vector. Also provided is a process for the production of 2KGA with a microorganism in which the gene encoding 
said aldehyde dehydrogenase is disrupted. 
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Aldeh yde dehydrogenase gene 

The present invention relates to a novel DNA which encodes aldehyde 
dehydrogenase (SNDH) derived from Ghiconobacter oxydans DSM 4025, an expression 
vector containing the DNA and a recombinant microorganism containing the expression 
vector. Furthermore, the present invention concerns a process for producing 
5 recombinant aldehyde dehydrogenase protein and a process for producing L-ascorbic 
acid (vitamin C) and/or 2-keto-L-gulonic acid (2-KGA) from L-sorbosone by using the 
recombinant aldehyde dehydrogenase protein or the recombinant microorganism 
containing said expression vector. 

Vitamin C is one of indispensable nutrient factors for human beings and has been 
10 commercially synthesized by the Reichstein process for about 60 years. Synthetic vitamin 
C is also used in animal feed even though farm animals can synthesize it in their own 
body. Although the Reichstein process has many advantageous points for industrial 
vitamin C production, it still has undesirable problems such as high energy consumption 
and usage of considerable quantities of organic and inorganic solvents. Therefore, over 
15 the past decades, many approaches to manufacture vitamin C using enzymatic 
conversions, which would be more economical as well as ecological, have been 
investigated. 

The present invention is directed to an.isolated nucleic acid molecule encoding 
aldehyde dehydrogenase which comprises a polynucleotide being at least 95% identical to 
20 the nucleotide sequence of SEQ ID NO: 1. 

As used herein, "SNDH" stands for aldehyde dehydrogenase. 

As used herein, "nucleic acid molecule" includes both DNA and RNA and, unless 
otherwise specified, includes both double-stranded, single-stranded nucleic acid, and 
nucleosides thereof. Also included are hybrids such as DNA-RNA hybrids, DNA-RNA- 
25 protein hybrids, RNA-protein hybrids, and DNA-protein hybrids. 

As used herein, "mutation" refers to a single base pair change, insertion or deletion 
in the nucleotide sequence of interest. 



CONFIRMATION COPY 
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As used herein, "mutagenesis" refers to a process whereby a mutation is generated 
in the DNA. With "random" mutagenesis, the exact site of mutation is not predictable, 
occurring anywhere in the chromosome of the microorganism, and the mutation is 
brought about as a result of physical damage caused by agents such as radiation or 
5 chemical treatment. 

As used herein, "promoter" means a DNA sequence generally described as the 5' 
region of a gene, located proximal to the start codon. The transcription of the adjacent 
gene(s) is initiated at the promoter region. If a promoter is an inducible promoter, then 
the rate of transcription increases in response to an inducing agent. In contrast, the rate 
10 of transcription is not regulated by an inducing agent if the promoter is a constitutive 
promoter. 

As used herein, "percent identical" refers to the percent of the nucleotides or amino 
acids of the subject nucleotide or amino acid sequence that have been matched to 
identical nucleotides or amino acids in the compared nucleotide or amino acid sequence 
15 by a sequence analysis program as exemplified below. 

The present invention includes an isolated nucleic acid molecule encoding 
aldehyde dehydrogenase which comprises a polynucleotide being at least 95% identical to 
the polynucleotide selected from the group consisting of (a) nucleotides 258-2084 of SEQ 
ID NO: 1, (b) nucleotides 351-2084 of SEQ ID NO: 1, (c) nucleotides 258-1955 of SEQ 
20 ID NO: 1, and (d) nucleotides 351-1955 of SEQ ID NO: 1. 

It is another aspect of the present invention to provide an isolated nucleic acid 
molecule encoding aldehyde dehydrogenase which comprises a polynucleotide selected 
from the group consisting of (a) a polynucleotide encoding the polypeptide having the 
amino acid sequence of SEQ ID NO: 2, (b) a polynucleotide encoding the polypeptide 

25 consisting of amino acids 32-609 of SEQ ID NO: 2, (c) a polynucleotide encoding the 
polypeptide consisting of amino acids 1-566 of SEQ ID NO: 2, and (d) a polynucleotide 
encoding the polypeptide consisting of amino acids 32-566 of SEQ ID NO: 2. Also 
included are proteins having SNDH activity and which are derived from a protein 
mentioned above by substitution, deletion, insertion or addition of one or more amino 

30 acid(s) in the amino acid sequences mentioned above. 



Functional derivatives as another aspect of the present invention are defined on the 
basis of the amino acid sequences of the present invention by addition, insertion, deletion 
and/or substitution of one or more amino acid residues of such sequences wherein such 
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derivatives still have the SNDH activity measured by an assay known in the art or 
specifically described herein. Such functional derivatives can be made either by chemical 
peptide synthesis known in the art or by recombinant techniques on the basis of the DNA 
sequences as disclosed herein by methods known in the state of the art. Amino acid 



molecules are known in the state of the art* 

In particular embodiments of the present invention, conservative substitutions of 
interest occur as follows: as example substitutions, Ala to Val/Leu/Ile, Arg to 
Lys/Gln/Asn, Asn to Gln/His/Lys/Arg, Asp to Glu, Cys to Ser, Gin to Asn, Glu to Asp, Gly 
to Pro/Ala, His to Asn/Gln/Lys/Arg, He to Leu/Val/Met/Ala/Phe/norLeu, Lys to 
Arg/Gln/Asn, Met to Leu/Phe/Ile, Phe to Leu/Val/Ile/Ala/Tyr, Pro to Ala, Ser to Thr, Thr 
to Ser, Trp to Tyr/Phe, Tyr to Trp/Phe/Thr/Ser, and Val to Ile/Leu/Met/Phe/Ala/norLeu 
are reasonable. As preferred examples, Ala to Val, Arg to Lys, Asn to Gin, Asp to Glu, 
Cys to Ser, Gin to Asn, Glu t Asp, Gly to Ala, His to Arg, lie to Leu, Leu to De, Lys to Arg, 
Met to Leu, Phe to Leu, Pro to Ala, Ser to Thr, Thr to Ser, Trp to Tyr, Tyr to Phe, and Val 
to Leu are reasonable. If such substitutions result in a change in biological activity, then 
more substantial changes, denominated exemplary substitutions described above, are 
introduced and the products screened. 

Furthermore the present invention is directed to polynucleotides encoding 
polypeptides having the SNDH activity as disclosed in the sequence listing as SEQ ID 
NO: 2 as well as the complementary strands, or those which include these sequences, 
DNA sequences or fragments thereof, and DNA sequences, which hybridize under 
standard conditions with such sequences but which encode for polypeptides having 
exactly the same amino acid sequence. 

Thus, the present invention provides an isolated nucleic acid molecule encoding a 
polypeptide having aldehyde dehydrogenase activity, wherein the complement of said 
" nucleic acid molecule hybridizes under standard conditions with a nucleic acid molecule 
as described above. It is an aspect of the invention to provide an isolated nucleic acid 
molecule encoding a polypeptide having aldehyde dehydrogenase activity, wherein said 
nucleic acid molecule hybridizes under standard conditions to the complementary strand 
of a nucleic acid molecule encoding (i) aldehyde dehydrogenase which comprises a 
polynucleotide being at least 95% identical to the nucleotide sequence of SEQ ID NO: 1; 
(ii) aldehyde dehydrogenase which comprises a polynucleotide being at least 95% 
identical to the polynucleotide selected from the group consisting of (a) nucleotides 258- 
2084 of SEQ ID NO: 1, (b) nucleotides 351-2084 of SEQ ID NO: 1, (c) nucleotides 258- 



exchanges in proteins and peptides which do not generally alter the activity of such 
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1955 of SEQ ID NO: 1, and (d) nucleotides 351-1955 of SEQ ID NO: 1; and (iii) aldehyde 



(a) a polynucleotide encoding the polypeptide having the amino acid sequence of SEQ ID 
NO: 2, (b) a polynucleotide encoding the polypeptide consisting of amino acids 32-609 
of SEQ ID NO: 2, (c) a polynucleotide encoding the polypeptide consisting of amino 
acids 1-566 of SEQ ID NO: 2, and (d) a polynucleotide encoding the polypeptide 
consisting of amino acids 32-566 of SEQ ID NO: 2. 

"Standard conditions" for hybridization mean in this context the conditions which 
are generally used by a person skilled in the art to detect specific hybridization signals, or 
preferably so called stringent hybridization conditions used by a person skilled in the art. 

Thus, as used herein, the term "stringent hybridization conditions" means that 
hybridization will occur if there is 95% and preferably at least 97% identity between the 
sequences. Stringent hybridization conditions are, e.g., conditions under over night 
incubation at 42°C using a digoxygenin (DIG)-labeled DNA probe (constructed by using 
a DIG labeling system; Roche Diagnostics GmbH, 68298 Mannheim, Germany) in a 
solution comprising 50% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 
0.2% sodium dodecyl sulfate, 0.1% N-lauroylsarcosine, and 2% blocking reagent (Roche 
Diagnostics GmbH), followed by washing the filters in O.lx SSC at about 60°C. 

This invention is also directed to a recombinant vector, i.e., an expression vector, 
comprising such a nucleic acid molecule as mentioned above. An expression vector of 
the present invention is one which functions in a suitable host cell. Preferred vectors for 
the expression of the nucleic acid molecules of the present invention are vectors or 
derivatives thereof which are selected from the group consisting of pQE, pUC, 
pBluescript II, pACYC177, pACYC184, pVKlOO, and RSF1010. 

A suitable host cell for expression of the nucleotide sequences of the present 
invention is a recombinant microorganism selected from the group consisting of 
bacteria, yeast, and plant cells. Preferably, the microorganism is selected from the group 
consisting of Gluconobacter* Acetobqcter y Pseudomonas, Acinetobacter y Klebsiella and 
Escherichia. An example of such a preferred microorganism is E. coli. A more preferred 
host cell belongs to Gluconobacter oxydans, most preferably G. oxydans DSM 4025 (FERM 
BP-3812), which had been deposited on March 17, 1987 under the conditions of the 
Budapest Treaty at the Deutsche Sammlung von Mikroorganismen und Zellkulturen 
GmbH, Braunschweig, Germany. 



dehydrogenase which comprises a polynucleotide selected from the group consisting of 
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The microorganism "Gluconobacter oxydans" also includes synonyms or basonyms 
of such species having the same physico-chemical properties, as defined by the 
International Code of Nomenclature of Prokaryotes. 

Thus, the present invention is directed to a recombinant microorganism which is 
transformed with the expression vector as described above or which comprises a nucleic 
acid molecule as described above integrated into its chromosomal DNA. 

A wide variety of host/vector combinations maybe used for cloning the double 
stranded nucleotide sequences of the present invention. As £. colt is a preferred host cell, 
any vectors usually used in R coli are useful for the present invention. Such vectors 
include, but are not limited to, pQE vectors which can express His-tagged recombinant 
proteins (QIAGEN K.K., Tokyo, Japan), pBR322 or its derivatives including pUC18 and 
pBluescript II (Stratagene Cloning Systems, California, USA), pACYC177 and 
pACYC184 and their derivatives, and a vector derived from a broad host range plasmid 
such as RK2 and RSF1010. Thus, the expression vector used in the present invention is 
derived from pQE-plasmids, pUC-plasmids, pBluescript II, pACYC177, pACYC184, and 
their derivative plasmids, and a broad host range plasmid such as pVKlOO and RSF1010. 

As used herein, "expression vector" means a cloning vector which is capable of 
enhancing the expression of a gene that has been cloned into it, after transformation into 
a suitable host. The cloned gene is usually placed under the control of (i.e., operably 
linked to) certain control sequences such as promoter sequences. Promoter sequences 
maybe either constitutive or inducible. 

As used herein, "cloning vector" means a plasmid or phage DNA or other DNA 
sequence which is able to replicate autonomously in a host cell, and which is 
characterized by a single or a small number of restriction endonuclease recognition sites 
at which such DNA sequences maybe cut in a determinable fashion without loss of an 
essential biological function of the vector, and into which a DNA fragment may be 
introduced in order to bring about its replication and cloning. The cloning vector may 
further contain a marker suitable for use in the identification of cells transformed with 
the cloning vector. Such markers provide, e.g., tetracycline or ampicillin resistance. 



As used herein, a "recombinant vector" includes any cloning or expression vector 
which contains the desired cloned gene(s). 
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As used herein, "expression" refers to the process by which a polypeptide is 
produced from a structural gene. The process involves transcription of the gene into 
mRNA and the translation of such mRNA into polypeptide(s). 

As used herein, "recombinant microorganism" includes a recombinant host which 
may be any prokaryotic or eukaryotic cell that contains the desired cloned gene(s) on an 
expression or cloning vector. This term also includes those prokaryotic or eukaryotic 
cells that have been genetically engineered to contain the desired gene(s) in the 
chromosome or genome of said microorganism. 

As used herein, "host" includes any prokaryotic or eukaryotic cell that is the 
recipient of a replicable expression vector or cloning vector. A "host", as the term is used 
herein, also includes prokaryotic or eukaryotic cells that can be genetically engineered by 
well known techniques to contain desired gene(s) on its chromosome or genome. 
Examples of such hosts are known to the skilled artisan. 

To construct a recombinant microorganism carrying recombinant DNA, e.g., a 
recombinant vector, various gene transfer methods including, but not limited to, 
transformation, transduction, conjugal mating or electroporation can be used. These 
methods are well-known in the field of molecular biology. Conventional transformation 
systems can be used for Gluconobacter, Acetobacter, Pseudomonas, Acinetobacter, Klebsiella 
or Escherichia. A transduction system can also be used for E. coli. Conjugal mating 
systems can be widely used in Gram-positive and Gram-negative bacteria including E. 
coli, P. putida, and Gluconobacter. An example of conjugal mating is disclosed in WO 
89/06,688. The conjugation can occur in liquid medium or on a solid surface. Examples 
for a suitable recipient for SNDH production include microorganisms of Gluconobacter 
Acetobacter, Pseudomonas, Acinetobacter, Klebsiella or Escherichia. To the recipient for 
conjugal mating, a selective marker maybe added, e.g., resistance against nalidixic acid or 
rifampicin. Natural resistance can also be used, e.g., resistance against polymyxin B is 
useful for many Gluconobacter s. 

Preferred vectors useful for the present invention are broad-host-range vectors 
such as a cosmid vector like pVKlOO and its derivatives and RSF1010. Copy number and 
stability of the vector should be carefully considered for stable and efficient expression of 
the cloned nucleic acid molecules and also for efficient cultivation of the host cell 
carrying said cloned molecules . Nucleic acid molecules containing transposable 
elements such as Tn5 can also be used to introduce the desired DNAs into the preferred 
host, especially on a chromosome. Nucleic acid molecules containing any DNAs isolated 
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from the preferred host together with the nucleotide sequences of the present invention 
are also useful to introduce the nucleotide sequences of the present invention into the 
preferred host cell, especially on a chromosome. Such nucleic acid molecules can be 
transferred to the preferred host by applying any of a conventional method, e.g., 
transformation, transduction, conjugal mating or electroporation, which are well known 
in the art, considering the nature of the host cell and the nucleic acid molecule. 

The nucleotide sequences including the SNDH gene provided in this invention are 
ligated into a suitable vector containing a regulatory region such as a promoter, a 
ribosomal binding site, and a transcriptional terminator operable in the host cell 
described above with a method well-known in the art to produce a suitable expression 
vector. 

To express the desired gene/nucleotide sequence isolated from G. oxydans DSM 
4025 efficiently, various promoters can be used; e.g., the original promoter of the gene, 
promoters of antibiotic resistance genes such as kanamycin resistant gene of Tn5, 
ampicillin resistant gene of pBR322, and beta-galactosidase of E. coli (lac), trp-, tac-, trc- 
promoter, promoters of lambda phage and any promoters which are functional in a host 
cell. For this purpose, the host cell can be selected from the group consisting of bacteria, 
yeast, and plant cells. Preferably, the host cell belongs to the genera Ghiconobacter, 
Acetobacter, Pseudomonas, Acinetobacter y Klebsiella or Escherichia. 

For expression, other regulatory elements, such as a Shine-Dalgarno (SD) sequence 
(e.g., AGGAGG, including natural and synthetic sequences operable in the host cell) and 
a transcriptional terminator (inverted repeat structure including any natural and syn- 
thetic sequence operable in the host cell) which are operable in the host cell (into which 
the coding sequence will be introduced to provide a recombinant cell of this invention) 
can be used with the above described promoters. 

For the expression of polypeptides which are located in the periplasmic space, like 
the SNDH protein of the present invention, a signal peptide, which contains usually 15 to 
50 amino acid residues and is totally hydrophobic, is preferably associated. A DNA 
encoding a signal peptide can be selected from any natural and synthetic sequence 
operable in the desired host cell. A putative signal peptide containing amino acid 
residues 1-31 of SEQ ID NO: 2 was also found in the protein expressed by the SNDH 
gene of the present invention (SEQ ID NO: 4). 
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Unless otherwise mentioned, all amino acid sequences determined by sequencing 
the purified SNDH protein herein were determined using an automated amino acid 
sequencer (such as model 470A, Perkin- Elmer Applied Biosystems). 

Unless otherwise indicated, all nucleotide sequences determined by sequencing a 
DNA molecule herein were determined using an automated DNA sequencer (such as the 
model ALF express II, Amersham Pharmacia Biotech), and all amino acid sequences of 
polypeptides encoded by DNA molecules determined herein were predicted by 
translation of the DNA sequence determined as above. Therefore, as it is known in the 
art for any DNA sequence determined by this automated approach, any nucleotide 
sequence determined herein may contain some errors. Nucleotide sequences determined 
by automation are typically at least about 90% identical, more typically at least about 
95% to at least about 99.9% identical to the actual nucleotide sequence of the sequenced 
DNA molecule. The actual sequence can be more precisely determined by other 
approaches including manual DNA sequencing methods well known in the art. As is also 
known in the art, a single insertion or deletion in a determined nucleotide sequence 
compared to the actual sequence will cause a frame shift in translation of the nucleotide 
sequence such that the predicted amino acid sequence encoded by a determined 
nucleotide sequence will be completely different from the amino acid sequence actually 
encoded by the sequenced DNA molecule, beginning at the point of such an insertion or 
deletion. 

The invention provides an isolated nucleic acid molecule encoding the enzyme 
(SNDH). Methods and techniques designed for the manipulation of isolated nucleic acid 
molecules are well known in the art. Methods for the isolation, purification, and cloning 
of nucleic acid molecules, as well as methods and techniques describing the use of 
eukaryotic and prokaryotic host cells and nucleic acid and protein expression therein, are 
known to the skilled person. 

Briefly, the SNDH gene, the DNA molecule containing said gene, the recombinant 
expression vector and the recombinant microorganism used in the present invention can 
be obtained by the following steps: , 

(1) isolating chromosomal DNA from G. oxydans DSM 4025 and constructing a gene 
library with the chromosomal DNA in an appropriate host cell, e.g., E. coli\ 

(2) cloning the SNDH gene from the chromosomal DNA by colony-, plaque-, or 
Southern-hybridization, PCR (polymerase chain reaction) cloning, western-blot analysis 
or other techniques known in the art; 
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(3) determining the nucleotide sequence of the SNDH gene obtained as above by 
conventional methods to select a DNA molecule containing said SNDH gene and 
constructing a recombinant expression vector on which the SNDH gene can be expressed 
efficiently, 

(4) constructing recombinant microorganisms carrying the SNDH gene by an 
appropriate method for introducing DNA into a host cell, e.g., transformation, 
transduction, conjugal transfer and/or electroporation, which host cell thereby becomes a 
recombinant microorganism of this invention. 

The materials and techniques used in the above aspect of the present invention are 
exemplified in detail as follows: 

A total chromosomal DNA can be purified by a procedure well known in the art. 
The desired gene can be cloned in either plasmid or phage vectors from a total 
chromosomal DNA typically by either of the following illustrative methods: 

(i) The partial amino acid sequences are determined from the purified proteins or 
peptide fragments thereof. Such whole protein or peptide fragments can be prepared by 
the isolation of such a whole protein or by peptidase- treatment from the gel after SDS- 
polyacryiamide gel electrophoresis. Thus obtained protein or fragments thereof are 
applied to protein sequencer such as Applied Biosystems automatic gas-phase sequencer 
470A. The amino acid sequences can be utilized to design and prepare oligonucleotide 
probes and/or primers with DNA synthesizer such as Applied Biosystems automatic 
DNA sequencer 381 A. The probes can be used for isolating clones carrying the target 
gene from a gene library of the strain carrying the target gene by means of Southern-, 
colony- or plaque-hybridization. 

(ii) Alternatively, for the purpose of selecting clones expressing a target protein from the 
gene library, immunological methods with antibodies prepared against the target protein 
can be applied. 

(iii) The DNA fragment of the target gene can be amplified from the total chromosomal 
DNA by PCR method with a set of primers, i.e., two oligonucleotides synthesized 
according to the amino acid sequences determined as above. Then a clone carrying the 

30 target- whole gene can be isolated from the gene library constructed, e.g., in E. coli> by 
Southern-, colony-, or plaque-hybridization with the PCR product obtained above as 



probe. 



35 



DNA sequences which can be made by PCR by using primers designed on the basis 
of the DNA sequences disclosed herein by methods known in the art are also an object of 
the present invention. 
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Above mentioned antibodies can be prepared with the purified SNDH proteins, the 
purified recombinant SNDH proteins such as His-tagged SNDH expressed in E. coli> or 
its peptide fragment as an antigen. 

Once a clone carrying the desired gene is obtained, the nucleotide sequence of the 
target gene can be determined by a well known method such as dideoxy chain 
termination method with M13 phage. 

Brief description of the Figures: 

Figure 1 illustrates the gene encoding a protein having aldehyde dehydrogenase 
activity of the present invention. The restriction map of the SNDH gene and ORF-A 
gene is given, wherein ORF means open reading frame and Signal seq. means the putative 
signal peptide sequence of the SNDH gene. 

Figure 2 illustrates the restriction map of the SNDH and ORF-A genes cloned into 
cosmid pVSN5 as well as cloning of insert DNA of different sizes into pUC plasmids 
pUCSNP4, pUCSNP9, pUCSN19, and pUCSNS. In the physical map of pVSN5, the 
arrow filled in gray shows the SNDH gene. 

Figure 3 shows the cloning strategy of the 8.0 kb Pstl fragment including the intact 
SNDH gene from pVSN5 into pUC18 vectors to result in pUCSNP4 and pUCSNP9. 
Note that pUCSNP4 and pUCSNP9 are identical except for the direction of the insert. 

Figure 4 shows schematically the construction of the GOMTRlSN::Km (SNDH- 
disruptant) by using a suicide vector plasmid having the disrupted SNDH gene with 
kanamycin cassette (Km). Homologous recombination between the vector plasmid and 
the chromosomal DNA of GOMTR1 as the parent strain at the corresponding region 
occurs to obtain the disruptant strains. "G.O." means Ghiconobacter oxydans. 

Figure 5 represents the physical map of the insert DNA of pVSN117, pVSN106, and 
pVSN114. Plasmid pVSN117 has the insert DNA encoding the C- terminal deleted 
SNDH gene (nucleotides 258-1955 of SEQ ID NO: 1, i.e., amino acids 1-566 of SEQ ID 
NO: 2), which expresses only a 55 kba protein. Plasmids pVSN106 and pVSN114 have 
the insert DNA encoding the intact SNDH gene. 

The gene of the present invention encodes an SNDH enzyme of 578 amino acid 
residues (SEQ ID NO: 5 consisting of amino acids 32-609 of SEQ ID NO: 2) together 
with a putative signal peptide of 31 amino acid residues (SEQ ID NO: 4 consisting of 
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amino adds 1-31 of SEQ ID NO: 2) as depicted in Figure 2. In terms of nucleotide 
sequences, the coding region of the SNDH gene encompasses nucleotides 258-2087 of 
SEQ ID NO: 1 and includes the coding sequences for a putative signal peptide 
(nucleotides 258-350 of SEQ ID NO: 1) and the stop codon (nucleotides 2085-2087 of 
SEQ ID NO: 1). Thus, the nucleotide sequence without the stop codon is the nucleotide 
sequence from position 258-2084 of SEQ ID NO: 1, and additionally without the signal 
sequence the nucleotide sequence encompasses nucleotides 351-2084 of SEQ ID NO: 1. 

The nucleic acid molecules as disclosed in the present invention can be used in a 
recombinant microorganism for the production of 2-KGA and/or vitamin C. The 
recombinant microorganism, which is selected from Gluconobacter, Acetobacter, 
Pseudomonas, Acinetobacter, Klebsiella and Escherichia, may be cultured in an aqueous 
medium supplemented with appropriate nutrients under aerobic conditions. The 
cultivation may be conducted at a pH of 4.0 to 9.0, preferably 6.0 to 8.0. The cultivation 
period varies depending on the pH, temperature and nutrient medium to be used, and is 
preferably about 1 to 5 days. The preferred temperature range for carrying out the 
cultivation is from about 13°C to about 36°C, preferably from about 18°C to about 33°C. 
It is usually required that the culture medium contains such nutrients as assimilable 
carbon sources, e.g., glycerol, D-mannitol, D-sorbitol, erythritol, ribitol, xylitol, arabitol, 
inositol, dulcitol, D-ribose, D-fructose, D-glucose, and sucrose, preferably D-sorbitol, D- 
mannitol, and glycerol; and digestible nitrogen sources such as organic substances, e.g., 
peptone, yeast extract, baker ! s yeast, urea, amino acids, and corn steep liquor. Various 
inorganic substances may also be used as nitrogen sources, e.g., nitrates and ammonium 
salts. Furthermore, the culture medium usually contains inorganic salts, e.g., magnesium 
sulfate, potassium phosphate, and calcium carbonate. The cultivation is carried out in 
appropriate equipment such as jar fermentors, flasks, or tubes. The recombinant 
microorganism is either transformed with an expression vector containing the nucleic 
acid molecules as above or comprises the nucleic acid molecule of the present invention 
integrated into its chromosomal DNA. 

An appropriate sugar compound used as substrate for the production of 2-KGA 
and/or vitamin C is L-sorbosone. The metabolic pathway for 2-KGA and vitamin C goes 
from D-sorbitol via L-sorbose to L-sorbosone, which is then converted to 2-KGA and/or 
vitamin C. Thus, the direct substrate for both products is L-sorbosone. 

Thus, it is an aspect of the present invention to provide a process for the 
production of 2-KGA and/or vitamin C from L-sorbosone comprising (a) propagating or 
» cultivating the recombinant microorganism, which is either transformed with an 
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the present invention or which comprises the nucleic acid molecules of the present 
invention integrated into its chromosomal DNA, in an appropriate culture medium and 
(b) recovering and separating 2-KGA and/or vitamin C from said culture medium. 

It is one embodiment to provide a process for the production of vitamin C and/or 
2-KGA from L-sorbosone comprising (a) propagating a recombinant organism in an 
appropriate culture medium, wherein the nucleic acid molecule as of the present 
invention is heterologously introduced to said recombinant organism, and (b) recovering 
and separating vitamin C and/or 2-KGA from said culture medium. 

The present invention provides recombinant SNDH. One can increase the 
production yield of the SNDH enzyme by introducing the SNDH gene provided by the 
present invention into a host cell including G. oxydans DSM 4025. One can also produce 
more efficiently the SNDH proteins in a host cell selected from a group consisting of 
Gluconobacter, Acetobacter, Pseiidomonas, Acinetobacter, Klebsiella and Escherichia by 
using the SNDH gene of the present invention. The microorganism maybe cultured as 
described above. 

An embodiment for the isolation and purification of the recombinant SNDH from 
the microorganism after the cultivation is briefly described hereinafter: cells are harvested 
from the liquid culture broth by centrifiigation or filtration. The harvested cells are 
washed with water, physiological saline or a buffer solution having an appropriate pH. 
The washed cells are suspended in the buffer solution and disrupted by means of a 
homogenizer, sonicator or French press, or by treatment with lysozyme to give a solution 
of disrupted cells. The recombinant SNDH is isolated and purified from the cell-free 
extract of disrupted cells, preferably from the cytosol fraction of the microorganism. The 
recombinant SNDH can be immobilized on a solid carrier for solid phase enzyme 
reaction. 

The invention is further directed to a process for the production of 2-KGA from L- 
sorbosone comprising (a) cultivating a microorganism belonging to Gluconobacter 
oxydans DSM 4025 in an appropriate culture medium, wherein the gene encoding 
aldehyde dehydrogenase represented by SEQ ID NO: 2 is disrupted in said 
microorganism, and (b) recovering and separating 2-KGA from said culture medium. 
The disruption may take place anywhere in the gene, resulting in a non-functioning of 
the encoded enzyme. 
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Thus, a process is provided for the production of 2-KGA via L-sorbosone from an 
appropriate sugar compound comprising (a) propagating a microorganism belonging to 
Gluconobacter oxydans DSM 4025 in an appropriate culture medium, wherein the gene 
encoding aldehyde dehydrogenase is disrupted in said microorganism, said aldehyde 
dehydrogenase being encoded by (i) a polynucleotide being at least 95% identical to the 
nucleotide sequence of SEQ ID NO: 1; (ii) a polynucleotide being at least 95% identical 
to the polynucleotide selected from the group consisting of (a) nucleotides 258-2084 of 
SEQ ID NO: 1, (b) nucleotides 351-2084 of SEQ ID NO: 1, (c) nucleotides 258-1955 of 
SEQ ID NO: 1, and (d) nucleotides 351-1955 of SEQ ID NO: 1; and (iii) a polynucleotide 
selected from the group consisting of (a) a polynucleotide encoding the polypeptide 
having the amino acid sequence of SEQ ID NO: 2, (b) a polynucleotide encoding the 
polypeptide consisting of amino acids 32-609 of SEQ ID NO: 2, (c) a polynucleotide 
encoding the polypeptide consisting of amino acids 1-566 of SEQ ID NO: 2, and (d) a 
polynucleotide encoding the polypeptide consisting of amino acids 32-566 of SEQ ID 
NO: 2. The resulting 2-KGA is further recovered and isolated from said culture medium. 

In one embodiment, the invention provides a process for the disruption of the 
SNDH gene by classical mutagenesis with agents such as UV-irradiation or chemical 
treatment by any mutation reagents, e.g., N-methyl-N'-nitro-N-nitrosoguanidine (NTG), 
ICR170 or acrydine orange, in vivo as well as in vitro. 

In another embodiment, the invention provides a process for the disruption of the 
SNDH gene by DNA recombination techniques such as transposon insertion or site 
directed mutagenesis by PCR, in vivo as well as in vitro. 

In another embodiment, the invention provides a process for producing 2-KGA 
using the disruptants described above by fermentation from an appropriate substrate, 
i.e., a sugar compound, which is selected from the group consisting of L-sorbosone, D- 
glucose, D-sorbitol, and L-sorbose. The process takes place in appropriate equipment 
such as jar fermentors, flasks, or tubes. Furthermore, the invention provides a process 
for producing 2-KGA using a cell free extract of the disruptants described above by 
incubation from an appropriate substrate, e.g., L-sorbosone, D-glucose, L-sorbose, and 
D-sorbitol, in appropriate equipment such as a bioreactor. 

The present invention provides recombinant SNDH. Furthermore, it is directed to 
a process for the production of aldehyde dehydrogenase comprising (a) cultivating a 
recombinant microorganism comprising a nucleic acid molecule encoding aldehyde 
dehydrogenase which comprises (i) a polynucleotide being at least 95% identical to the 
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nucleotide sequence of SEQ ID NO: 1; (ii) a polynucleotide being at least 95% identical 
to the polynucleotide selected from the group consisting of (a) nucleotides 258-2084 of 
SEQ ID NO: 1, (b) nucleotides 351-2084 of SEQ ID NO: 1, (c) nucleotides 258-1955 of 
SEQ ID NO: 1, and (d) nucleotides 351-1955 of SEQ ID NO: 1; and (iii) a polynucleotide 

5 selected from the group consisting of (a) a polynucleotide encoding the polypeptide 
having the amino acid sequence of SEQ ID NO: 2, (b) a polynucleotide encoding the 
polypeptide consisting of amino acids 32-609 of SEQ ID NO: 2, (c) a polynucleotide 
encoding the polypeptide consisting of amino acids 1-566 of SEQ ID NO: 2, and (d) a 
polynucleotide encoding the polypeptide consisting of amino acids 32-566 of SEQ ID 

10 NO: 2; wherein said microorganism is cultivated in an appropriate culture medium and 
(b) wherein said aldehyde dehydrogenase is recovered and separated from said culture 
medium. 



Example 1 : Amino acid sequencing from the N- terminus of SNDH 

15 The partial amino acid sequence of the N-terminal 75 kDa subunit of the SNDH 

protein was determined. About 10 \kg of the SDS-treated purified SNDH consisting of 75 
kDa subunits was subjected to SDS-PAGE, and the protein band was electroblotted onto 
a PVDF membrane. The protein blotted on the membrane was soaked in a digestion 
buffer (100 mM potassium phosphate buffer, 5 mM dithiothreitol, 10 mM EDTA, pH 

20 8.0) and incubated with 5.04 )Xg of pyroglutamate aminopeptidase (SIGMA, USA) at 
30°C for 24 hours. After incubation, the membrane was washed with deionized water 
and subjected to N-terminal amino acid sequencing using an automated amino acid 
sequencer (ABI model 490, Perkin Elmer Corp., Conn., USA). As a result, 14 residues of 
the N-terminal amino acid sequence were obtained as illustrated in SEQ ID NO: 3. 

25 

Example 2: Cloning of partial SNDH gene by PCR 

Amplification of the partial SNDH gene fragment was carried out by PCR with 
chromosomal DNA of G. oxydans DSM 4025 (FERM BP-3812) and degenerated 
oligonucleotide DNA primers, Pll (SEQ ID NO: 6) and P12 (SEQ ID NO: 7). Both 
30 primers were degenerated DNA mixtures having bias for Gluconobacter codon usage. 
The PCR was performed with thermostable taq polymerase (TAKARA Ex Taq™, Takara 
Shuzo Co., Ltd., Seta 3-4-1, Otsu, Shiga, 520-2193, Japan), using a thermal cycler (Gene 
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10 



15 



20 



25 



Amp PCR System 2400-R, PE Biosystems, 850 Lincoln Centre Drive, Foster City, CA 
94404, USA). The reaction mixture (25 (il) consisted of 200 |JlM of dNTPs, 50 pmol of 
each primer (24 ~ 48 degeneracy), 5 ng of the chromosomal DNA, and 1.25 units of the 
DNA polymerase in the buffer provided from the supplier. The reaction was carried out 
with 5 cycles of denaturation step at 94°C for 30 sec, annealing step at 37°C for 30 sec, 
synthesis step at 70°C for 1 min followed by 25 cycles of denaturation step at 94°C for 30 
sec, annealing step at 50°C for 30 sec, synthesis step at 70°C for 1 min. As a result, a 41 
bp DNA fragment was specifically amplified and cloned into vector pCR 2.1-TOPO 
(Invitrogen, 1600 Faraday Avenue Carlsbad, California 92008, USA) to obtain a 
recombinant plasmid pMTSN2. The nucleotide sequence of the cloned 41 bp DNA 
fragment which encodes an N-terminal partial amino acid sequence of the mature SNDH 
protein was confirmed by dideoxy- chain termination method (F. Sanger et al, Proc. Natl. 
Acad. Sci. USA, 74, 5463-5467, 1977). 

Example 3: Complete cloning of the SNDH gene 

(1) Construction of a gene library of G. oxydans DSM 4025 

The chromosomal DNA of G. oxydans DSM 4025 was prepared from cells grown 
on M agar medium containing 5% D-mannitol, 1.75% corn steep liquor, 5% baker f s 
yeast, 0.25% MgS0 4 *7H 2 0, 0.5% CaC0 3 (Practical Grade), 0.5% urea, and 2.0% agar 
(pH 7.0), for 4 days at 27°C. The chromosomal DNA (4 pig) was partially digested with 4 
units of EcoR I in 20 jLtl of reaction mixture. A portion (8 pi) of the sample containing 
partially-digested DNA fragments was separated by an electrophoresis using 1% agarose 
gel. Fragments ranging from 15 to 35 kb were cut out and chemically melted to recover 
the fragments using QIAEX II (QIAGEN Inc., 28159 Avenue Stanford, Valencia, CA 
91355, USA). The objective DNA fragments recovered were suspended in H 2 0. On the 
other hand, 2 pig of a cosmid vector pVKlOO was completely digested with EcoR I and 
dephosphorylated of the 5'-ends by treating with bacterial alkaline phosphatase (E .coli 
C75) (Takara Shuzo). The treated pVKlOO (220 ng) was ligated with the 15 - 35 kb EcoR 
I fragments (1 \xg) using a ligation kit (Takara Shuzo) in 36 pi of reaction mixture. The 
ligated DNA which had been ethanol precipitated and resolved in appropriate volume of 
TE buffer (10 mM Tris-HCl pH 8.0, 1 mM EDTA) was used for in vitro packaging 
(Gigapack III Gold Packaging Extract, Stratagene, 11011 North Torrey Pines Road, La 
Jolla, CA 92037, USA) to infect E. coli VCS257, a host strain for the genomic library. As a 
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result, totally 400,000 - 670,000 clones containing about 25 kb-inserted DNA fragments 
were obtained. 

(2) Complete cloning of the SNDH gene by colony hybridization 

The probe that would be used for screening of the cosmid library described above 
5 to detect clones carrying the complete SNDH gene by colony hybridization method, was 
constructed. The 41 bp DNA fragment encoding the N-terminal amino acid sequence of 
SNDH was amplified and labeled by PCR-DIG labeling method (Roche Molecular 
Systems Inc., 1145 Atlantic Avenue, Alabama, CA94501, USA). PCR with plasmid 
pMTSN2 DNA as a template and oligonucleotide DNA primers, P13 (SEQ ID NO: 8) 

10 and P14 (SEQ ID NO: 9), was performed with thermostable taq polymerase (TAKARA 
Ex Taq™, Takara Shuzo Co., Ltd.), using a thermal cycler (Gene Amp PCR System 2400- 
R, PE Biosystems). The reaction was carried out with 25 cycles of denaturation step at 
94°C for 30 sec, annealing step at 55°C for 30 sec, synthesis step at 70°C for 1 min. Using 
the DIG-labeled probe, screening of the cosmid library (about 1,000 clones) by colony 

15 hybridization and chemiluminescent detection according to the method provided from 
the supplier (Roche Molecular Systems Inc., USA) was carried out As a result, 3 positive 
clones were isolated and one of them was designated pVSN5, which carried about 25 kb 
insert DNA in pVKlOO vector. From this, 25 kb DNA insert fragments of different sizes 
were further subcloned into pUCl8 vectors (Figure 2): (1) a 3.2 kb EcoR I fragment 

20 comprising the upstream portion (the N-terminal part) of the SNDH gene resulting in 
pUCSN19, (2) a 7.2 kb EcoR I fragment comprising the downstream portion (the C- 
terminal part) of the SNDH gene resulting in pUCSN5, and (3) a 1.8 kb Pst I fragment 
comprising the intact or complete SNDH gene resulting in pUCSNP4 and pUCSNP9, 
respectively. Note that the inserts in pUSNP4 and pUSNP9 are the same but in opposite 

25 directions. 

(3) Nucleotide sequencing of the SNDH gene 

Plasmids pUCSNl9, pUCSN5, and pUCSNP4 were used for nucleotide sequencing 
of a region including the SNDH gene or gene fragments. The determined nucleotide 
sequence (SEQ ID NO: 1 with 3,408 bp) revealed that the ORF of the SNDH gene (1,827 
30 bp; nucleotides 258-2084 of SEQ ID NO: 1) encoded a polypeptide of 609 amino acid 
residues (SEQ ID NO: 2). An additional ORF, ORF-A, was found downstream of the 
SNDH ORF as illustrated in Figure 1. ORF-A (1,101 bp; nucleotides 2214-3314 of SEQ 
ID NO: 1) encoded a polypeptide of 367 amino acids. 
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In the ORF of the SNDH gene, a signal peptide-like sequence (SEQ ID NO: 4 with 
31 amino acids), is possibly included in the deduced amino acid sequence, which contains 
(i) many hydrophobic residues, (ii) positively-charged residues close to the N-terminus 
and (iii) Ala-Xaa-Ala motif for cleavage site of the signal sequence. The putative 
ribosome-binding site (Shine-Dalgarno, SD, sequence) for the SNDH gene was located at 
6 bp upstream of the initiation codon (AGGAGA at nucleotide position 247-252 of SEQ 
ID NO: 1). Furthermore, a motif (Cys-Xaa-Xaa-Cys-His) defined as heme c binding site 
was found at position 530-534 of SEQ ID NO: 2. From the genetically analysis as shown 
above, the SNDH protein is thought to be one of quinohemoproteins. 

A homology search for the SNDH gene using the program of FASTA in GCG 
(Genetics Computer Group, Madison, WI, USA) revealed that Arg227, Asn228, Gln230, 
Gly246, and Asp251 of SEQ ID NO: 5 correspond to several highly conserved residues in 
the presumed active site of A. calcoaceticus GDH-B protein described by Oubrie et al. [J. 
MoL Biol. 289:319-333 (1999)]. 

Example 4: Expression of the SNDH gene in E. colt 

Plasmids pUCSNP4 and pUCSNP9 (Figure 3), containing the 8.0 kb Pst I-fragment 
with the intact, i.e., complete SNDH gene, were transformed into E. colt JM109 to 
confirm the expression and the activity of the SNDH proteins. The amount of vitamin C 
produced as the enzyme activity was measured at a wavelength of 264nm by a high 
performance liquid chromatography system (HPLC) which was composed with a UV 
detector (TOSOH UV8000; Tosoh, Japan), a dualpump (TOSOH CCPE; Tosoh), an 
integrator (Shimadzu C-R6A; Shimadzu, Japan) and a column (YMC-Pack Polyamine-II; 
4.6 mm of inner diameter [i.d.] X 15 cm, YMC, U.S.A.). 

The conversion activity of L-sorbosone to vitamin C by using cytosol fraction of 
the recombinant E. colt was tested (Table 1). Cells were cultivated in LB medium 
optionally supplemented with 10 |nM of PQQ and 1.0 mM of CaCl 2 . The cytosol fraction 
was prepared by ultracentrifugation (100,000 x g y 45 min) of the cell free extract in 50 
mM potassium phosphate buffer (pH 7.0). The reaction mixture (100 fxl) consisted of 
125 jxg of cytosol fraction of the recombinant E. coli, 50 mM of L-sorbosone, 1.0 mM of 
phenazine mesosulfate (PMS), with or without the addition of 1.0 JJ.M of PQQ and 1.0 
mM of CaCl 2 as co factors, depending on the case. The enzyme reaction was carried out 
at 30°C for 30 min. The holo-SNDHs of the cells cultivated in LB medium containing 10 
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|xM of PQQ and 1.0 mM of CaCl 2 produced vitamin C definitely under the defined 
reaction condition without the cofactors of PQQ and CaCl 2 . By addition of the cofactors, 
the apo-SNDH expressed with pUCSNP4 and pUCSNP9 showed almost the same activity 
as those of the holo-enzyme. 



Table 1: activity measurement of recombinant SNDH 



Microorganism 


PQQ and CaCl 2 
in the medium 


Specific activity (mU/mg Protein) 


with 
PQQ and CaCl 2 


without 
PQQ and CaCl 2 


E. coli JM109/pUCSNP4 


+ 


0.187 


0.224 


E. coKJM109/pUCSNP9 


+ 


0.198 


0.252 


E. coli JM109/pUC18 


+ 


0.000 


0.000 


Kcoli JM109/pUCSNP4 




0.155 


0.000 


Rcoli JM109/pUCSNP9 




0.176 


0.000 


Kcoli JM109/pUC18 




0.000 


0.000 


G. oxydans DSM 4025 




0.026 


0.026 



One unit (U) of the enzyme was defined as the amount of enzyme, which produces 1.0 
mg of vitamin C in the defined reaction. 



10 Example 5: Construction and cultivation of SNDH-gene disruptants of G. oxydans 
strains 

Figure 4 shows the scheme for the construction of SNDH gene targeting vector, 
GOMTRlSN::Km (SNDH-disruptant). First, plasmid pSUPSN was constructed by a 
ligation of the 8.0 kb Pst I fragment containing the SNDH gene from plasmid pUCSNP4 

15 with a suicide vector pSUP202 (for reference see Simon et al., A broad host range 

mobilization system for in vitro genetic engineering: transposon mutagenesis in Gram 
negative bacteria, Biotechnology, 1, 784-791, 1983). Second, a kanamycin-resistant-gene 
cassette (Km cassette) was inserted into the EcoR I site of the SNDH gene cloned in 
plasmid pSUPSN to obtain plasmid pSUPSN::Km (Km r Tc r ). Then, plasmid 

20 pSUPSN::Km was introduced into GOMTRl, which was a rifampicin (Rif) resistant 
derived spontaneously from wildtype G. oxydans DSM 4025 strain, to result in SNDH- 
null mutants (Km r Rif Tc s ) . 
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GOMTR1 was cultivated in a 200 ml flask containing 50 ml of T broth, which was 



Cockeysville, MD 21030, USA) and 3 g/1 of yeast extract (Difco; Becton Dickinson 
Microbiology Systems, Becton Dickinson and Company, Sparks, MD 21152, USA) with 
100 jig/ml of rifampicin at 30°C overnight. R coli HB101 (pRK2013) [D. H. Figurski, 



cultivated in test tubes containing 2 ml of LB medium with 50 Hg/ml of kanamycin at 
30°C overnight. Cultured cells of GOMTRl, E. coli HB101 (pRK2013), and R coli JM109 
(pSUPSN::Km) were collected separately by centrifugation and each cell suspension in LB 
medium was mixed in the ratio of 10:1:1, respectively. Then these cell suspensions were 
mixed at the same volume and the mixture was spread out on a 0.45 \im nitrocellulose 
membrane (PROTRAN, Schleicher & Schuell GmbH, Postfach 4, D-37582 Dassel, 
Germany) put on an agar medium, which was composed of 5.0% mannitol, 0.25% 
MgS0 4 *7H 2 0, 1.75% corn steep liquor, 5.0% baker's yeast, 0.5% urea, 0.5% CaC0 3 , and 
2.0% agar, for conjugal transfer of the suicide plasmid from the £. coli donor to 
GOMTRl as recipient. After cultivation at 27°C for 1 day, the cells containing 
transconjugants were suspended and diluted appropriately with T broth, and spread out 
on the screening agar plates containing 100 jxg/ml of rifampicin and 50 Hg/ml of 
kanamycin. Finally, several transconjugants (Km r Rif Tc s ) which had the disrupted 
SNDH gene with Km cassette were obtained. 

GOMTRl and the disruptants, GOMTRlSN::Km, were grown on an agar plate 
containing 8.0% L-sorbose, 0.25% MgS0 4 *7H 2 0, 1.75% corn steep liquor, 5.0% baker's 
yeast, 0.5% urea, 0.5% CaC0 3 ; and 2.0% agar at 27°C for 4 days. One loopful of the cells 
was inoculated into 50 ml of a seed culture medium (pH 6.0) containing 4% D-sorbitol, 
0.4% yeast extract, 0.05% glycerol, 0.25% MgS0 4 -7H 2 0, 1.75% corn steep liquor, 0.1% 
urea, and 1.5% CaC0 3 in a 500 ml Erlenmeyer flask, and cultivated at 30°C with 180 rpm 
for 1 day on a rotary shaker. The seed culture thus prepared was used for inoculating 50 
ml of a main culture medium, which was composed of 12.0% L-sorbose, 2.0% urea, 
0.05% glycerol, 0.25% MgS0 4 *7H 2 0, 3.0% corn steep liquor, 0.4% yeast extract, and 
1.5% CaC03 in a 500 ml Erlenmeyer flask. The cultivation was carried out at 30°C and 
180 rpm for 4 days. The activity assay was performed as described in Example 4. The 
amount of 2-KGA produced as the enzyme activity was measured at a wavelength of 
340nm by a HPLC system that was composed with a UV detector (TOSOH UV8000; 
Tosoh), a dualpump (TOSOH CCPE; Tosoh), an integrator (Shimadzu C-R6A; 
Shimadzu) and a column (YMC-Pack Pro C18, YMC). As shown in Table 2, the 



composed of 30 g/1 of Trypticase Soy Broth (BBL; Becton Dickinson and Company, 



Proc. Natl. Acad. Sci. USA, 76, 1648-1652, 1979] and E. coli JM109 (pSUPSN::Km) were 
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production efficiency for 2-KGA of the SNDHrgene disruptants was higher than that of 
the parent strain GOMTR1. The difference of the conversion rate per mol L-sorbose to 
2-KGA was about 3%. 



Table 2: 2-KGA production of G. oxydans strains having a disrupted SNDH-gene 



Strain 


2-KGA 


Residual L-sorbose 


*Molar yield 




(g/L) 


(g/L) 


(mol %) 


GOMTRlSN::Km 


96.7 


15.3 


99.2 


GOMTR1 


98.8 


9.8 


95.5 



*Molar yield: mol 2-KGA produced/mol L-sorbose consumed. 



Example 6: Introduction of the plasmids carrying the SNDH gene into the SNDH- 
gene disruptant of G. oxydans DSM 4025 

10 Several kinds of SNDH-expression plasmids using broad host range vector pVKlOO 

were constructed as shown in Figure 5. Those plasmids have different insert DNAs at the 
Hind III site of pVKlOO described as follows: pVSNl 17 has the insert DNA containing 
the incomplete SNDH gene encoding a polypeptide ending at Gly535 of SEQ ID NO: 5 
(amino acid residue 566 of SEQ ID NO: 2), i.e., a C-terminal deleted SNDH gene, which 

15 expresses only a 55 kDa protein. Plasmids pVSN106 and pVSNl 14, respectively, have the 
insert DNA containing the complete SNDH gene. Those plasmids were introduced into 
strain GOMTRlSN::Km by conjugal transfer method. 

The transconjugants having the plasmids shown in Figure 5 were grown on an agar 
plate containing 10.0% L-sorbose, 0.25% MgS0 4 '7H 2 0, 1.75% corn steep liquor, 5.0% 

20 baker's yeast, 0.5% urea, 0.5% CaC0 3 , and 2.0% agar at 27°C for 4 days. The enzyme 
reaction mixture consisted of 80 ug of cell free extract of the recombinant Gluconobacter 
strains, 25 mM potassium phosphate buffer (pH 7.0), 50 mM of L-sorbosone, and 0.05 
mM of PMS. The enzyme reaction was carried out at 30°C for 30 min with shaking at 
1,000 rpm. The activity assay was performed according to Example 4. The result is 

25 shown in Table 3. 
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Table 3: production of vitamin C with different SNDH constructs 



Host cell 


Vector DNA 


Vitamin C produced 
(mg/L) 


GOMTRlSN::Km-2 


pVKlOO 


0.0 




pVSN117 


473.2 




pVSN106 


845.3 




pVSN114 


860.2 
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Claims 



1. An isolated nucleic acid molecule encoding aldehyde dehydrogenase which 
comprises a polynucleotide being at least 95% identical to the nucleotide sequence of 
SEQ ID NO: 1. 

2. An isolated nucleic acid molecule encoding aldehyde dehydrogenase which 
comprises a polynucleotide being at least 95% identical to the polynucleotide selected 
from the group consisting of (a) nucleotides 258-2084 of SEQ ID NO: 1, (b) nucleotides 
351-2084 of SEQ ID NO: 1, (c) nucleotides 258-1955 of SEQ ID NO: 1, and (d) 
nucleotides 351-1955 of SEQ ID NO: 1. 

3. An isolated nucleic acid molecule encoding aldehyde dehydrogenase which 
comprises a polynucleotide selected from the group consisting of (a) a polynucleotide 
encoding the polypeptide having the amino acid sequence of SEQ ID NO: 2, (b) a 
polynucleotide encoding the polypeptide consisting of amino acids 32-609 of SEQ ID 
NO: 2, (c) a polynucleotide encoding the polypeptide consisting of amino acids 1-566 of 
SEQ ID NO: 2, and (d) a polynucleotide encoding the polypeptide consisting of amino 
acids 32-566 of SEQ ID NO: 2. 

4. An isolated nucleic acid molecule encoding a polypeptide having aldehyde 
dehydrogenase activity, wherein said nucleic acid molecule hybridizes under standard 
conditions to the complementary strand of a nucleic acid molecule of any one of claims 1 
to 3. 

5. An expression vector which comprises the nucleic acid molecule of any one of 
claims 1 to 4. 

6. The expression vector of claim 5, wherein said vector is selected from a vector or a 
derivative thereof selected from the group consisting of pQE, pUC, pBluescript II, 
pACYC177, pACYC184, pVKlOO and RSF1010. 

7. A recombinant microorganism which is transformed with the expression vector of 
claim 5. 

8. A recombinant microorganism which comprises the nucleic acid molecule of any 
one of claims 1 to 4 integrated into its chromosomal DNA. 

9. The recombinant microorganism of claim 7 or 8, wherein said microorganism is 
selected from the group consisting of bacteria, yeast, and plant cells. 
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10. The recombinant microorganism of claim 9, wherein said microorganism is 
selected from the group consisting of Gluconobacter, Acetobacter, Pseudomonas, Klebsiella, 
Acinetobacter y and Escherichia. 

11. The recombinant microorganism of claim 10, wherein said microorganism is 
Gluconobacter oxydans DSM 4025. 

12. A process for the production of 2-keto-L-gulonic acid (2-KGA) and/or vitamin C 
from L-sorbosone comprising (a) cultivating the recombinant microorganism of claim 7 
or 8 in an appropriate culture medium, and (b) recovering and separating 2-KGA and/or 
vitamin C from said culture medium. 

13. A process for the production of 2-KGA from L-sorbosone comprising (a) 
cultivating a microorganism belonging to Gluconobacter oxydans DSM 4025 in an 
appropriate culture medium, wherein the gene encoding aldehyde dehydrogenase 
represented by SEQ ID NO: 2 is disrupted in said microorganism, and (b) recovering and 
separating 2-KGA from said culture medium. 

14. A process for the production of aldehyde dehydrogenase comprising (a) cultivating 
a recombinant microorganism comprising a nucleic acid molecule of any one of claims 1 
to 4 in an appropriate culture medium, and (b) recovering and separating said aldehyde 
dehydrogenase from said culture medium. 
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Sequence Listings 

<110> Roche Vitamins AG 

<12 0> A gene encoding aldehyde dehydrogenase and use thereof 

<130> 21424 

<140> 

<141> 

<160> 9 

<170> Patentln Ver. 3.1 



10 <210> 1 

<211> 3408 
<212> DNA 

<213> Gluconobacter oxydans 
<220> 
15 <221> CDS 

<222> (258) . . (2087) 

<220> 

<221> CDS 

<222> (2214) . . (3317) 

20 





<400> 1 














GCGACTGGCA 


GCAGCGCAAC 


TATGACCACT 


ATGGCCTGCC 


GCCCTATTGG 


50 




ATCTAACTGA 


TCCAGTAAGC 


CACCATCAGC 


CGGCCCCTGC 


GGGGGCCGGC 


100 




TTTTTGCGCT 


AGACCCCGCC 


GAGGTGCTGT 


CGTAACC TAA 


GGTCACATCT 


150 


25 


TTACTTCCAC 


ATCCGCCCTT 


GTCAGTTCTG 


AC GTGAC AAA 


TTGTCGCGGT 


200 




CATGCTGCTG 


AATGCGGATG 


CCAGTCCCAG 


ATCCAAGCCC 


GACGCAAGGA 


250 




GAC GTAGATG 


TTACCCAAAT 


CATTGAAACA 


TAAGAATGGC 


GCCATGCGCC 


300 




TTGTCGCAGC 


CTCGACCCTT 


GCGCTGATGA 


TCGGCGCGGG 


TGCCCATGCG 


350 




CAGGTAAACC 


CGGTCGAAGT 


GCCGGTGGGC 


GCGAACGAGA 


CCTTTACCTC 


400 


30 


GCGCGTGCTG 


ACCACCGGCC 


TGTCGAACCC 


TTGGGAAATC 


ACCTGGGGCC 


450 




CCGACAATAT 


GCTGTGGGTG 


AC CGAGCGAT 


CTTCCGGCGA 


AGTGACGCGC 


500 




GTCGACCCCA 


ATACCGGCGA 


GCAGCAGGTC 


CTGCTGACCC 


TGACCGATTT 


550 




CAGCGTCGAT 


GTGCAACACC 


AGGGCCTACT 


TGGCCTCGCG 


CTGCATCCTG 


600 




AGTTTATGCA 


AGAGAGCGGC 


AACGACTACG 


TCTATATCGT 


CTACACTTAT 


650 


35 


AACACCGGCA 


CCGAAGAAGC 


GCCCGATCCG 


CATCAAAAGC 


TGGTGCGTTA 


700 




TGCCTATGAC 


GCTGCCGCGC 


AGCAGCTGGT 


CGATCCGGTT 


GATCTGGTCG 


750 




CAGGCATTCC 


CGCAGGCAAC 


GACCACAATG 


GCGGTCGCAT 


CAAATTCGCC 


800 




CCCGATGGCC 


AACACATCTT 


TTACACGCTG 


GGCGAGCAAG 


GCGCGAACTT 


850 




TGGCGGTAAC 


TTCCGCCGTC 


CGAACCACGC 


GCAACTGCTG 


CCGACGCAAG 


900 


40 


AGCAGGTCGA 


CGCGGGCGAT 


TGGGTCGCCT 


ATTCGGGCAA 


GATCCTGCGC 


950 
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GTGAACCTTG ACGGCACGAT CCCCGAAGAC AACCCCGAGA TCGAGGGCGT 1000 

GCGTAGCCAT ATCTTTACCT ATGGCCACCG TAACCCGCAG GGCATCACCT 1050 

TTGGCCCCGA CGGCACCATT TATGCCACCG AACACGGCCC CGATACGGAT 1100 

GACGAGCTGA ACATCATCGC CGGCGGTGGC AACTATGGGT GGCCGAATGT 1150 

5 GGCCGGCTAT CGCGATGGCA AATC CTATGT CTACGCTGAT TGGAGCCAAG 1200 

CGCCCGCTGA CCAGCGTTAC ACCGGTCGCG CCGGTATCCC CGACACCGTG 1250 

CCGCAATTCC CCGAGCTGGA ATTCGCGCCC GAGATGGTCG ATCCGCTGAC 13 00 

AACCTATTGG ACGGTGGATA ATGATTACGA TTTCACCGCC AATTGCGGCT ^1350 

GGATCTGTAA TCCGACGATC GCGCCTTCGT CTGCCTATTA CTATGCGGCG 1400 

10 GGCGAGAGCG GTATCGCGGC TTGGGATAAT TCGATCCTGA TCCCGACGCT 1450 

GAAACATGGC GGCATCTATG TGCAGCACCT CAGCGATGAT GGCCAATCTG 1500 

TCGACGGCCT GCCCGAGCTG TGGTTCAGCA CCCAGAACCG CTATCGCGAT 1550 

ATCGAGATCA GCCCCGATAA CCATGTTTTT GTGGCGACCG ACAACTTTGG 1600 

CACCTCGGCG CAGAAATATG GCGAGACCGG CTTTACCAAC GTGCTGCATA 1650 

15 ACCCCGGCGC GATCCTTGTC TTTAGCTATG TCGGCGAGGA TGCTGCGGGT 1700 

CAGACCGGAA TGATGACCGC GCCCGCACCG CAGACGCAAT ACACGCAAGT 1750 

GCCCGCCGAG GGTGCAGGCG CGGGCGCGAC TGAGGTTGCG GATGTCGATT 1800 

ACGACACGCT GTTCACCGAA GGCCAGACCC TTTATGGCAG CGCATGTGCC 1850 

GCGTGCCATG GTGCCGCTGG CCAAGGTGCG CAGGGCCCGA CCTTTGTGGG 1900 

20 CGTGCCGGAT GTGACGGGTG ACAAGGACTA CCTTGCCCGC ACCATCATCC 1950 

AC GGTTTTGG CTATATGCCG TCGTTTGCGA CTCGGCTGGA TGACGAGGAG 2000 

GTTGCCGCCA TCGCGACCTT TATCCGCAAC AGCTGGGGCA ATGACGAAGG 2 050 

CATCCTGACC CCGGCCGAGG CCGCTGCCAC CCGCTGAATG CTGTAAAAAC 2100 

CACCCTCGCC TGCACATCAG GCGGGGGTAT TTCATTTATT TTCACATCTG 2150 

25 CCTTTGACAT GTGCCGCTAT CACGGTTAAT GCGGCCCTTC GGCTGTTCTG 22 00 

GGTCTAAGCG GGTGTGTTGC CCGATAAGAG AGACGGTTCA GTCCCTCCCG 2250 

CCCTATTTAG GGCCCATTTA GGCAGAATAG TTTTGACTCA TCAAAATATC 2300 

GCCGCGCCTC TGGCCGCGGC CCTTTCGCAA CGTGGATATG AAACGCTGAC 2350 

CGCCGTGCAG CAAGCTGTGC TTGCGCCCGA GGCTGATGGC CGCGACCTGC 2400 

30 TGGTGTCGGC ACAGACCGGT TCGGGTAAGA CGGTGGCCTT TGGTATCGCA 2450 

GTCGCGCCCG ACCTTTTGGG CGACGACAAT ATCCTGCCGC TGAACACGCC 2500 

GCCTGTTGCG CTGTTCATCG CCCCCACGCG CGAGCTTGCG CTGCAAGTTG 2550 

CTCAGGAACT GACCTGGCTT TACGCCAATG CAGGTGCCCA GATCGCGACC 2 600 

TGCGTCGGCG GTATGGATTA CCGCACCGAG CGCCGCGCCC TTGCACGTCT 2 650 

35 GCCGCAAATC GTTGTCGGCA CGCCCGGCCG TCTGCGCGAC CATATCGACC 2700 

GTGGCGGCCT TGACCTGTCC GAATTGC GCG TGACCGTGCT GGACGAAGCG 2750 

GATGAGATGC TCGACCTCGG CTTCCGCGAT GATCTGCAAT ATATCTTGCA 2800 

AGCCGCGCCC GAAGATCGCC GCACGCTGAT GTTCTCGGCC ACCGTGCCGC 2850 

GCGAGATTGA AAAACTGGCC CGCGACTTCC AAAATGACGC CCTGCGTCTG 2900 

40 GAAACCCGTG GCGAGGCCAA GCAGCACAAC GACATCAGCT ACCAAGCTTT 2950 

GTCGGTCACC ATGCGCGATC GCGAAAACGC CATTTTCAAC ATGCTGCGTT 3000 
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TTTATGAATC GCGCACGGCG ATCATCTTCT GCAAGACCCG CGCCAATGTG 3 050 
AATGATCTGC TGTCGCGGAT GAGCGGTCGT GGCTTCCGCG TGGTGGCCCT 3100 
GTCGGGCGAG CTGTCGCAAC AGGAACGCAC CAACGCGCTG CAAGCGCTGC 3150 
GTGATGGCCG CGCCAACGTT TGTATCGCGA CCGACGTCGC GGCGCGCGGC 3200 
5 ATTGACTTGC CGGGCCTCGA GCTGGTGATC CACTACGATC TGCCGACCAA 3250 
TGCCGAAACC CTGCTGCACC GCTCGGGCCG TACCGGCCGC CGGGTGCCAA 3300 
GGGCGTCTCG GCGCTGATCG TCACCCCCGG CGATTTCAAA AAAGCGCAGC 3350 
GTTTGCTGAG CTTTGCCAAA GTGACCGCGG AATGGGGCAA GGCGCCTTCG 3400 
GCCGAAGA 3408 



<210> 2 
<211> 609 
<212> PRT 
15 <213> Gluconobacter oxydans 
<220> 

<221> SIGNAL 
<222> (1) . . (31) 
<220> 
20 <221> CHAIN 

<222> (32) . . (609) 

<400> 2 

Met Leu Pro Lys Ser Leu Lys His Lys Asn Gly Ala Met Arg Leu 15 

25 

Val Ala Ala Ser Thr Leu Ala Leu Met He Gly Ala Gly Ala His 30 

Ala Gin Val Asn Pro Val Glu Val Pro Val Gly Ala Asn Glu Thr 45 

30 Phe Thr Ser Arg Val Leu Thr Thr Gly Leu Ser Asn Pro Trp Glu 60 

He Thr Trp Gly Pro Asp Asn Met Leu Trp Val Thr Glu Arg Ser 75 

Ser Gly Glu Val Thr Arg Val Asp Pro Asn Thr Gly Glu Gin Gin 90 

35 

Val Leu Leu Thr Leu Thr Asp Phe Ser Val Asp Val Gin His Gin 105 

Gly Leu Leu Gly Leu Ala Leu His Pro Glu Phe Met Gin Glu Ser 12 0 

40 Gly Asn Asp Tyr Val Tyr He Val Tyr Thr Tyr Asn Thr Gly Thr 135 



10 



Glu Glu Ala Pro Asp Pro His Gin Lys Leu Val Arg Tyr Ala Tyr 150 
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Asp Ala Ala Ala Gin Gin Leu Val Asp Pro Val Asp Leu Val Ala 165 
Gly lie Pro Ala Gly Asn Asp His Asn Gly Gly Arg lie Lys Phe 180 

5 

Ala Pro Asp Gly Gin His lie Phe Tyr Thr Leu Gly Glu Gin Gly 195 
Ala Asn Phe Gly Gly Asn Phe Arg Arg Pro Asn His Ala Gin Leu 210 
10 Leu Pro Thr Gin Glu Gin Val Asp Ala Gly Asp Trp Val Ala Tyr 225 
Ser Gly Lys lie Leu Arg Val Asn Leu Asp Gly Thr lie Pro Glu 240 
Asp Asn Pro Glu lie Glu Gly Val Arg Ser His lie Phe Thr Tyr 255 

15 

Gly His Arg Asn Pro Gin Gly lie Thr Phe Gly Pro Asp Gly Thr 270 
lie Tyr Ala Thr Glu His Gly Pro Asp Thr Asp Asp Glu Leu Asn 285 
20 lie lie Ala Gly Gly Gly Asn Tyr Gly Trp Pro Asn Val Ala Gly 300 
Tyr Arg Asp Gly Lys Ser Tyr Val Tyr Ala Asp Trp Ser Gin Ala 315 
Pro Ala Asp Gin Arg Tyr Thr Gly Arg Ala Gly He Pro Asp Thr 330 

25 

Val Pro Gin Phe Pro Glu Leu Glu Phe Ala Pro Glu Met Val Asp 345 
Pro Leu Thr Thr Tyr Trp Thr Val Asp Asn Asp Tyr Asp Phe Thr 3 60 
30 Ala Asn Cys Gly Trp He Cys Asn Pro Thr He Ala Pro Ser Ser 375 
Ala Tyr Tyr Tyr Ala Ala Gly Glu Ser Gly He Ala Ala Trp Asp 390 
Asn Ser He Leu He Pro Thr Leu Lys His Gly Gly He Tyr Val 405 

35 

Gin His Leu Ser Asp Asp Gly Gin Ser Val Asp Gly Leu Pro Glu 420 
Leu Trp Phe Ser Thr Gin Asn Arg Tyr Arg Asp He Glu He Ser 435 
40 Pro Asp Asn His Val Phe Val Ala Thr Asp Asn Phe Gly Thr Ser 450 
Ala Gin Lys Tyr Gly Glu Thr Gly Phe Thr Asn Val Leu His Asn 465 
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Pro Gly Ala lie Leu Val Phe Ser Tyr Val Gly Glu Asp Ala Ala 480 
Gly Gin Thr Gly Met Met Thr Ala Pro Ala Pro Gin Thr Gin Tyr 495 

5 

Thr Gin Val Pro Ala Glu Gly Ala Gly Ala Gly Ala Thr Glu Val 510 
Ala Asp Val Asp Tyr Asp Thr Leu Phe Thr Glu Gly Gin Thr Leu 525 
10 Tyr Gly Ser Ala Cys Ala Ala Cys His Gly Ala Ala Gly Gin Gly 540 
Ala Gin Gly Pro Thr Phe Val Gly Val Pro Asp Val Thr Gly Asp 555 
Lys Asp Tyr Leu Ala Arg Thr lie He His Gly Phe Gly Tyr Met 570 

15 

Pro Ser Phe Ala Thr Arg Leu Asp Asp Glu Glu Val Ala Ala He 585 
Ala Thr Phe He Arg Asn Ser Trp Gly Asn Asp Glu Gly He Leu 600 
20 Thr Pro Ala Glu Ala Ala Ala Thr Arg 609 

<210> 3 
<211> 14 
25 <212> PRT 

<213> Gluconobacter oxydans 

<400> 3 

Gin [Xaa/Gly] Asn [Pro/Lys] Val Glu Val Pro Val Gly Ala Asn Glu Thr 

30 

, <210> 4 
<211> 31 
<212> PRT 
35 <213> Gluconobacter oxydans ' 

<220> 

<221> SIGNAL 
<222> (1) . . (31) 

40 

<400> 4 
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Met Leu Pro Lys Ser Leu Lys His Lys Asn Gly Ala Met Arg Leu 15 
Val Ala Ala Ser Thr Leu Ala Leu Met lie Gly Ala Gly Ala His 30 
5 Ala 31 

<210> 5 
<211> 578 
10 <212> PRT 

<213> Gluconobacter oxydans 

<220> 

<221> CHAIN 
15 <222> (1) . . (578) 

<400> 5 

Gin Val Asn Pro Val Glu Val Pro Val Gly Ala Asn Glu Thr Phe 15 

20 Thr Ser Arg Val Leu Thr Thr Gly Leu Ser Asn Pro Trp Glu lie 3 0 

Thr Trp Gly Pro Asp Asn Met Leu Trp Val Thr Glu Arg Ser Ser 45 

Gly Glu Val Thr Arg Val Asp Pro Asn Thr Gly Glu Gin Gin Val 60 

25 

Leu Leu Thr Leu Thr Asp Phe Ser Val Asp Val Gin His Gin Gly 75 

Leu Leu Gly Leu Ala Leu His Pro Glu Phe Met Gin Glu Ser Gly 90 

30 Asn Asp Tyr Val Tyr He Val Tyr Thr Tyr Asn Thr Gly Thr Glu 105 

Glu Ala Pro Asp Pro His Gin Lys Leu Val Arg Tyr Ala Tyr Asp 120 

Ala Ala Ala Gin Gin Leu Val Asp Pro Val Asp Leu Val Ala Gly 135 

35 

He Pro Ala Gly Asn Asp His Asn Gly Gly Arg lie Lys Phe Ala 150 

Pro Asp Gly Gin His He Phe Tyr Thr Leu Gly Glu Gin Gly Ala 165 

40 Asn Phe Gly Gly Asn Phe Arg Arg Pro Asn His Ala Gin Leu Leu 180 
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Pro Thr Gin Glu Gin Val Asp 
Gly Lys lie Leu Arg Val Asn 
5 Asn Pro Glu lie Glu Gly Val 
His Arg Asn Pro Gin Gly He 
Tyr Ala Thr Glu His Gly Pro 

10 

He Ala Gly Gly Gly Asn Tyr 
Arg Asp Gly Lys Ser Tyr Val 
15 Ala Asp Gin Arg Tyr Thr Gly 
Pro Gin Phe Pro Glu Leu Glu 
Leu Thr Thr Tyr Trp Thr Val 

20 

Asn Cys Gly Trp He Cys Asn 
Tyr Tyr Tyr Ala Ala Gly Glu 
25 Ser He Leu He Pro Thr Leu 
His Leu Ser Asp Asp Gly Gin 
Trp Phe Ser Thr Gin Asn Arg 

30 

Asp Asn His Val Phe Val Ala 
Gin Lys Tyr Gly Glu Thr Gly 
35 Gly Ala He Leu Val Phe Ser 
Gin Thr Gly Met Met Thr Ala 
Gin Val Pro Ala Glu Gly Ala 

40 

Asp Val Asp Tyr Asp Thr Leu 
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Ala Gly Asp Trp Val Ala Tyr Ser 195 
Leu Asp Gly Thr He Pro Glu Asp 210 
Arg Ser His He Phe Thr Tyr Gly 225 
Thr Phe Gly Pro Asp Gly Thr He 240 
Asp Thr Asp Asp Glu Leu Asn He 255 
Gly Trp Pro Asn Val Ala Gly Tyr 270 
Tyr Ala Asp Trp Ser Gin Ala Pro 285 
Arg Ala Gly He Pro Asp Thr Val 300 
Phe Ala Pro Glu Met Val Asp Pro 315 
Asp Asn Asp Tyr Asp Phe Thr Ala 330 
Pro Thr He Ala Pro Ser Ser Ala 345 
Ser Gly He Ala Ala Trp Asp Asn 3 60 
Lys His Gly Gly He Tyr Val Gin 375 
Ser Val Asp Gly Leu Pro Glu Leu 390 
Tyr Arg Asp He Glu He Ser Pro 405 
Thr Asp Asn Phe Gly Thr Ser Ala 420 
Phe Thr Asn Val Leu His Asn Pro 435 
Tyr Val Gly Glu Asp Ala Ala Gly 450 
Pro Ala Pro Gin Thr Gin Tyr Thr 465 
Gly Ala Gly Ala Thr Glu Val Ala 480 
Phe Thr Glu Gly Gin Thr Leu Tyr 495 
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Gly Ser Ala Cys Ala Ala Cys His Gly Ala Ala Gly Gin Gly Ala 510 
Gin Gly Pro Thr Phe Val Gly Val Pro Asp Val Thr Gly Asp Lys 525 

5 

Asp Tyr Leu Ala Arg Thr He He His Gly Phe Gly Tyr Met Pro 540 
Ser Phe Ala Thr Arg Leu Asp Asp Glu Glu Val Ala Ala He Ala 555 
10 Thr Phe He Arg Asn Ser Trp Gly Asn Asp Glu Gly He Leu Thr 570 
Pro Ala Glu Ala Ala Ala Thr Arg 578 

15 <210> 6 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> an artificially synthesized primer sequence 



<210> 7 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<223> an artificially synthesized primer sequence 
<220> 

35 <221> misc_feature 

<222> 9 

<223> n is a or g or c or t 



<400> 6 



carggyaacc csgtbga 



17 



30 



<220> 



40 



<400> 7 

gtytcgttng crccvac 



17 
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<210> 8 

<211> 15 

<212> D3SEA 

5 <213> Artificial Sequence 

<220> 

<223> an artificially synthesized primer sequence 

10 <400> 8 

cagggtaacc cggtc 

<210> 9 
15 <211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> an artificially synthesized primer sequence 



<400> 9 

gactcgtttg cgccc 
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} BUDAPEST TREATY ON THE INTERNATIONAL 

^TCOCNITION OF THE DEPOSIT OF MICROORGANISMS 
FOR THE PURPOSES OF PATENT PROCEDURE 



INTERNATIONAL FORM 

The Oriental Scientific 
Instruments, Import and Export 

Corp. for Inst, of Microbiology 
Academia Sinica 
52 San-Li-He Rd. 
Beijing, P.R. China 



L DEPOSITOR 


IL IDENTIFICATION OF THE MICROORGANISM 


Name: ^ Q Oriental Scientific 

Instmments, Import and Export 
Address: Corp for Inst of Microbiology 

Academia Sinica 

52 San-Li-He Rd. 

Beijing, P.R. China 


Accession number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY: 

DSM 4025 
Date of the deposit or the transfer 1 : 

1987-03-17 


ni. VIABILITY STATEMENT 



The viability of the microorganism identified under II above was tested on 2003-08-13 
On that date, the said microorganism was ^ 



( x y viable 

( ) 3 no longer viable 



IV. CONDITIONS UNDER WHICH THE VIABILITY TEST HAS BEEN PERFORMED 4 



VIABILITY STATEMENT 
issued pursuant to Rule 10.2 by the 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified at the bottom of this page 



V. INTERNATIONAL DEPOSITARY AUTHORITY 



Name: 



DSMZ-DEUTSCHE SAMMLUNG VON* 
MIKROORGANISMEN UND ZELLKULTUREN GmbH 



Address: Maschcroder Wcg lb 

D-38124 Braunschweig 



Signature^) of person(s) having the power to represent the 
International Depositary Authority or of authorized officials): 

Date: 2003-08-15 



8 Indicate the date of original deposit or, where a new deposit or a transfer has been made, the most recent relevant date (date of the new deposit or date 
of the transfer). 

2 In the cases referred to in Rule 10.2(a) (ii) and (iii), re fcr to the most recent viability test. 
s Mark with a cross the applicable box. 

4 Fill in if the information has been requested and if the results of the test were negative. 
Form DSMZ-BP/9 (sole page) 12/2001 



